Every document has a geographical scope

نویسندگان

  • Geoffrey Andogah
  • Gosse Bouma
  • John Nerbonne
چکیده

It is a useful premise to assume that every document in a collection and every query issued to an information retrieval (IR) system is geography-dependent. If one can determine what area an article is about (i.e., its’ geographical scope), this information can be used to improve the accuracy with which people, places and organizations named in the article can be located. More importantly, geographical scopes of documents may be exploited to improve the performance of IR systems against geography-dependent user queries by tuning relevance ranking and query expansion strategies with scope metadata. We want to answer the following pertinent questions to ascertain the usefulness of geographical information in improving retrieval accuracy: (1) how far can geographical information in queries and documents improve retrieval accuracy of IR systems when answering geography-dependent queries; and, (2) how effectively can geographical information in queries and documents be utilized to improve the quality of relevance ranking in geographical IR domain. This paper outlines strategies to determine the geographical scope of documents, and describes methods to utilize scope information to improve the performance of toponym resolution, relevance ranking and query expansion.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assigning Geographical Scopes To Web Pages

Finding automatic ways of attaching geographical scopes to on-line resources, also called “geo-referencing” documents, is a challenging problem, getting increasing attention [1, 5, 3]. Here we present a system architecture and a process for identifying the geographical scope of Web pages, defining a scope as the region where more people than average would find that page relevant. We rely on typ...

متن کامل

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

Experiments Adapting An Open-Domain Question Answering System Tothe Geographical Domain Using Scope-Based Resources

This paper describes an approach to adapt an existing multilingual Open-Domain Question Answering (ODQA) system for factoid questions to a Restricted Domain, the Geographical Domain. The adaptation of this ODQA system involved the modification of some components of our system such as: Question Processing, Passage Retrieval and Answer Extraction. The new system uses external resources like GNS G...

متن کامل

جستاری در شناخت نظری مفهوم جغرافیای فرهنگی در چارچوب مکتب سازه‌انگاری

In the human science, one concept may be having been some definition or even may be this narration was contradiction with each other in different philosophical schools. Therefore, explanation of one concept in different cognition schools has very great importance. Geographical Space consists of bilateral relation between human and environment. In other word, this dimension and its ideological s...

متن کامل

جستاری در شناخت بازتاب فضایی عملکرد بازیگران سیاسی در چارچوب مکتب پدیدارشناسی هرمنوتیک

Extended abstract Introduction   In the human science, one concept may be having some definition or even may be this narration was contradiction with each other in different philosophical schools. Therefore, explanation of one concept or relationship in different cognition schools has very great importance. From philosophical aspects in human science, theoretical structure has very fundamen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Data Knowl. Eng.

دوره 81-82  شماره 

صفحات  -

تاریخ انتشار 2012